Beyond histograms: why learned structure-preserving descriptors outperform HOG

نویسندگان

  • Thomas Guthier
  • Volker Willert
  • Julian Eggert
چکیده

Statistical image descriptors based on histograms (e.g. SIFT [1], HOG [2]) are widely used in image processing, because they are fast and simple methods with high classification performance. However, they discard the local spatial topology and thus lose discriminative information contained in the image. We discuss the relations between HOG and VNMF descriptors, i.e. structure free histograms versus learned structure-preserving patterns. VNMF is a shift-invariant, sparse, nonnegative unsupervised learning algorithm [8, 9, 5], that provides a distinct decomposition of the input into its parts. The VNMF descriptor outperforms the statistical HOG descriptor, because it preserves spatial topology leading to better classification results on real-world human action recognition benchmarks [11, 12].

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Histograms of Oriented Gradients for 3D Object Retrieval

3D object retrieval has received much research attention during the last years. To automatically determine the similarity between 3D objects, the global descriptor approach is very popular, and many competing methods for extracting global descriptors have been proposed to date. However, no single descriptor has yet shown to outperform all other descriptors on all retrieval benchmarks or benchma...

متن کامل

’Histograms of Oriented Gradients for Human Detection’ versus ’Fast Human Detection Using a Cascade of Histograms of Oriented Gradients’

Dalal and Triggs [1] studied the question of feature sets for robust visual object recognition. They first considered existing edge and gradient based descriptors and then they showed experimentally that grids of Histograms of Oriented Gradients (HoG) descriptors significantly outperform existing feature sets for human detection. After this they studied the influence of each stage of the comput...

متن کامل

One-Shot-Learning Gesture Recognition Using HOG-HOF Features

The purpose of this paper is to describe one-shot-learning gesture recognition systems developed on the ChaLearn Gesture Dataset (ChaLearn). We use RGB and depth images and combine appearance (Histograms of Oriented Gradients) and motion descriptors (Histogram of Optical Flow) for parallel temporal segmentation and recognition. The Quadratic-Chi distance family is used to measure differences be...

متن کامل

Real-Time Visual Tracking through Fusion Features

Due to their high-speed, correlation filters for object tracking have begun to receive increasing attention. Traditional object trackers based on correlation filters typically use a single type of feature. In this paper, we attempt to integrate multiple feature types to improve the performance, and we propose a new DD-HOG fusion feature that consists of discriminative descriptors (DDs) and hist...

متن کامل

One-shot-learning Gesture Recognition Using Hog-hof Features Bachelor Thesis One-shot-learning Gesture Recognition Using Hog-hof Features Bachelor Thesis Názov: One-shot-learning Gesture Recognition Using Hog-hof Features

The purpose of this thesis is to describe one-shot-learning gesture recognition systems developed on the ChaLearn Gesture Dataset [3]. We use RGB and depth images and combine appearance (Histograms of Oriented Gradients) and motion descriptors (Histogram of Optical Flow) for parallel temporal segmentation and recognition. The Quadratic-Chi distance family is used to measure differences between ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014